Linguistic Categories as Basins of Curvature
نویسنده
چکیده
Linguistic grammars do a good job of elucidating the lexical categories and phrasal units of the languages they model. Certain recurrent Connectionist networks can model similar data but it is not easy to discern the abstract structures of the resulting representations. Hierarchical clustering is helpful in this regard, but it often produces implausible clusters and there seems to be no principled way of deciding which clusters are relevant. A more promising approach is to examine the curvature of the hidden space image in the output space. Because the relative frequency distributions of lexical items tend to be highly categorical in nature, with probabilistic variation in only a few dimensions, they generally lie on the linear surfaces of the output space. Consequently, categories are associated with regions of low curvature. I show that categoryextraction based on this principle can produce a more revealing analysis than hierarchical clustering in certain cases.
منابع مشابه
Estimation of Source Location Using Curvature Analysis
A quadratic surface can be fitted to potential-field data within 3×3 windows, which allow us to calculate curvature attributes from its coefficients. Phillips (2007) derived an equation depending on the most negative curvature to obtain the depth and structural index of isolated sources from peak values of special functions. They divided the special functions into two categories: Model-specific...
متن کاملInvestigation of flood capability in Jafarabad basin
One of the important and effective factors in the destruction of natural resources is the flood phenomenon, identification of this phenomenon and effective parameters in flood occurrence in natural resources and catchment areas is necessary. The purpose of this study was to determine the flooding of sub-basins in Jafarabad basin in Ilam province. In this research, the Jafarabad basin in Ilam Pr...
متن کاملبررسی آشفتگی در الگوی خطر سیلاب در تهران
Flood as a natural disaster follows certain erratic patterns which was made confounding factor. Flood risk is variable and complex that depends on very phenomena such as rainfall, runoff concentration and high exposure of the flooding downstream areas. This are changes over time and from regions due to natural conditions, human activities, and damage culture...
متن کاملA Cross-linguistic and Cross-cultural Study of Epistemic Modality Markers in Linguistics Research Articles
Epistemic modality devices are believed to be one of the prominent characteristics of research articles as the commonly used genre among the academic community members. Considering the importance of such devices in producing and comprehending scientific discourse, this study aimed to cross–culturally and cross-linguistically investigate epistemic modality markers as an important subcategory...
متن کاملApplication of surface-derived attributes in determining boundaries of potential-field sources
This paper describes an edge detection method based on surface-derived attributes. The surface-derived attributes are widely used in the interpretation ofseismic datain two main categories: (1) derivative attributes including the dip angle and the azimuth; (2) derivative attributes including curvature attributes. In general, the magnitude of the normal curvature of a surface (curvature attri...
متن کامل